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<212> DNA 
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<400> 1 

cgacctggcc gccggccgct cctccgcgcg ctgttccgca cttgctgccc tcgcccggcc 60 
cggagcgccg ctgccatgcg gctggcgctg ctctgggccc tggggctcct gggcgcgggc 120 
agccctctgc cttcctggcc gctcccaaat atagccctgc tgtcgattcc ctcagtactg 180 
tcttggggtg tcctgggacc tgcaggtggc actgaggagc agcaggcaga gtcagagaag 240 
gccccgaggg agcccttgga gccccaggtc cttcaggacg atctcccaat tagcctcaaa 300 
aaggtgcttc agaccagtct gcctgagccc ctgaggatca agttggagct ggacggtgac 360 
agtcatatcc tggagctgct acagaatagg gagttggtcc caggccgccc aaccctggtg 420 
tggtaccagc ccgatggcac tcgggtggtc agtgagggac acactttgga gaactgctgc 480 
taccagggaa gagtgcgggg atatgcaggc tcctgggtgt ccatctgcac ctgctctggg 540 
ctcagaggct tggtggtcct gaccccagag agaagctata ccctggagca ggggcctggg 600 
gaccttcagg gtcctcccat tatttcgcga atccaagatc tccacctgcc aggccacacc 660 
tgtgccctga gctggcggga atctgtacac actcagacgc caccagagca ccccctggga 720 
cagcgccaca ttcgccggag gcgggatgtg gtaacagaga ccaagactgt ggagttggtg 780 
attgtggctg atcactcgga ggcccagaaa taccgggact tccagcacct gctaaaccgc 840 
acactggaag tggccctctt gctggacaca ttcttccggc ccctgaatgt acgagtggca 900 
ctagtgggcc tggaggcctg gacccagcgt gacctggtgg agatcagccc aaacccagct 960 
gtcaccctcg aaaacttcct ccactggcgc agggcacatt tgctgcctcg attgccccat 1020 
gacagtgccc agctggtgac tggtacttca ttctctgggc ctacggtggg catggccatt 1080 
cagaactcca tctgttctcc tgacttctca ggaggtgtga acatggacca ctccaccagc 1140 
atcctgggag tcgcctcctc catagcccat gagttgggcc acagcctggg cctggaccat 1200 
gatttgcctg ggaatagctg cccctgtcca ggtccagccc cagccaagac ctgcatcatg 1260 
gaggcctcca cagacttcct accaggcctg aacttcagca actgcagccg acgggccctg 1320 
gagaaagccc tcctggatgg aatgggcagc tgcctcttcg aacggctgcc tagcctaccc 1380 
cctatggctg ctttctgcgg aaatatgttt gtggagccgg gcgagcagtg tgactgtggc 1440 
ttcctggatg actgcgtcga tccctgctgt gattctttga cctgccagct gaggccaggt 1500 
gcacagtgtg catctgacgg accctgttgt caaaattgcc agctgcgccc gtctggctgg 1560 
cagtgtcgtc ctaccagagg ggattgtgac ttgcctgaat tctgcccagg agacagctcc 1620 
cagtgtcccc ctgatgtcag cctaggggat ggcgagccct gcgctggcgg gcaagctgtg 1680 
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tgcatgcacg ggcgttgtgc ctcctatgcc cagcagtgcc agtcactttg gggacctgga 1740 
gcccagcccg ctgcgccact ttgcctccag acagctaata ctcggggaaa tgcttttggg 1800 
agctgtgggc gcaaccccag tggcagttat gtgtcctgca cccctagaga tgccatttgt 18 60 
gggcagctcc agtgccagac aggtaggacc cagcctctgc tgggctccat ccgggatcta 1920 
ctctgggaga caatagatgt gaatgggact gagctgaact gcagctgggt gcacctggac 1980 
ctgggcagtg atgtggccca gcccctcctg actctgcctg gcacagcctg tggccctggc 2040 
ctggtgtgta tagaccatcg atgccagcgt gtggatctcc tgggggcaca ggaatgtcga 2100 
agcaaatgcc atggacatgg ggtctgtgac agcaacaggc actgctactg tgaggagggc 2160 
tgggcacccc ctgactgcac cactcagctc aaagcaacca gctccctgac cacagggctg 2220 
ctcctcagcc tcctggtctt attggtcctg gtgatgcttg gtgccagcta ctggtaccgt 2280 
gcccgcctgc accagcgact ctgccagctc aagggaccca cctgccagta cagggcagcc 2340 
caatctggtc cctctgaacg gccaggacct ccgcagaggg ccctgctggc acgaggcact 2400 
aaggctagtg ctctcagctt cccggccccc ccttccaggc cgctgccgcc tgaccctgtg 24 60 
tccaagagac tccagtctca ggggccagcc aagcccccac ccccaaggaa gccactgcct 2520 
gccgaccccc agggccggtg cccatcgggt gacctgcccg gcccaggggc tggaatcccg 2580 
cccctagtgg taccctccag accagcgcca ccgcctccga cagtgtcctc gctctacctc 2640 
tgacctctcc ggaggttccg ctgcctccaa gccggactta gggcttcaag aggcgggcgt 2700 
gccctctgga gtcccctacc atgactgaag gcgccagaga ctggcggtgt cttaagactc 27 60 
cgggcaccgc cacgcgctgt caagcaacac tctgcggacc tgccggcgta gttgcagcgg 2820 
gggcttgggg aggggctggg ggttggacgg gattgaggaa ggtccgcaca gcctgtctct 2880 
gctcagttgc aataaacgtg acatcttgga aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2940 
aaaaaaaaaa aaaaaaaaaa aaaaaaaa 2968 
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Pro 
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<210> 3 

<211> 17138 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> (1) . . . (17138) 
<223> n = A,T,C or G 

<400> 3 

ttgggtgacc ctgggcagtg atcacatctc caagcatcag ttttctcacc tgaaaaaaag 60 
gagatgataa taacactatc tgccttacat gacaattgaa ttgaattttt tttttttttt 120 
tgagactaag tctcactctg tcgcccaggc tggagtgcag tggcgtgatc ttggctcact 180 
gcaacctcca cctccccagt tcaagcgatt ctcgtgcctc agcttcccga gtagctggga 240 
ttacaggcac acactaccac gcccggctaa tttagaattg aaataattta tgtacagtat 300 
cttagtacag gacctgacat tataaacaat gagtggcagc cattcttatt taatcagtcc 360 
taacaaagtt cataaaagtg agactgtgtt tgcttagctt tttccctagg gcctggatac 420 
ccccagcccc catgacacac aataggggcc aaatgaatgt gttgtgaaaa aatgaaaaac 480 
aaaaaacaaa aaagaacatg ctgggattcc ttgacagggt cgtgaagcaa actgaatgtg 540 
aatgcacaga tggaaatgtg ccagacagtc attccaagca gaatgtgcaa agactcagtc 600 
cacagggaat gcgaagtgcc agggctagtc tcaggagaaa ctggctcaga agagacagct 660 
ctcagggagg gctaaagtag gaaagaggct agaaagggac caggtgaggg aaggctctga 720 
aggccaagcc caagagttct gcctgtctgg caggcagcag ggcctctgga gtttcttggg 780 
caaagagtgg ctgcttcctg ggtaaggtgg cctgtggaaa atccctgaca actgtgtaga 840 
gacatgtcgt gagggatggc agggagcata gtgaactagg tttgtggttt ggaatcaggg 900 
cccctggggt ccagccaagt tggattgttt actatctgtg tgactttgag agtcacttca 960 
cctttctcaa ctgtaaagtg gggatagcaa cagtgatagt cgatctggcc tgctcacttc 1020 
tcagcctcac tgtgagaacc aaataagatg atttacagga aagtgcaaat gagagttgtg 1080 
gctgatatcc gcttggagag agcctggagg gtgcatcctc ccattctcca tcacagagtt 1140 
ggggagggag gcaccctcgc cctccagggg tttcctttgt ccaacccagc ctcctccaac 1200 
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acgcgggaat tgtcaggcct ggcgacttca 
tcccgcctcg ctcccgggct ggcgctaacg 
gatatcctgc accgcggctg cccgctcctg 
tgccaggcac ccgagacgcc cgagtcctac 
tactgctggt cgtgctggga cgacatgcgg 
gagctctctt cctccgcctt tagtgacagc 
ggcgtcctgc tcgctcttcc gcaccgtcct 
acgtgggtcg gggactgggg tgagccgcgc 
ccgcccgacc tctccgagcc aggccgcgca 
ggaggggtcc ggacctggcg ccgggtgaag 
ccggctccca ctccgcacag ttgcggggaa 
ggcggtgtcc ccgctcccga ggcacccggc 
agcttggctt ggggccgggt gggagggggc 
cgctgctggg ttctccgagg cgacctggcc 
cttgctgccc tcgcccggcc cggagcgccg 
tggggctcct gggcgcgggc agccctctgc 
cctccgcctg gagtgggtcg gggggcggac 
attagggtaa tggggccgga cggagaccct 
tggtccgctg tcctgggcct agggcccggt 
gggggatgcc ggcgccccct gcctcctgcc 
ctttcaggct caggtaccag gtaccgaggg 
gagagggtcg ctgctcccca ggactggcat 
ccacccggtg ccagggcgct gagcgggcat 
tctagcctct attccaggcc tggggccgcc 
cagtgcctgg ccgagcggga ggggcccgag 
aaagaggctg ggacaccaac tcctccttgg 
gggaccgaga gggcaggcgc gggtggagtg 
gggggtgggg ggtgctcgag gaggaagcag 
gcatgagggt gtttagggag gtgggggtgt 
tggcatctcc gatgtgagcg aaggggaggg 
cagtctcgag agggggcgtt cctgaagtgg 
gcggagggtc ctgtgagaac ctgggattgg 
cgcaagccga aagtctttct tggggacttg 
ttagagagac accctctcct tccagtattc 
cccctcaggt acctaagggt cttgatgggc 
acccccagcc ctgctgtcga ttccctcagt 
tggcactgag gagcagcagg cagagtcaga 
ggtccttcag gacgatctcc caattagcct 
ccctctaata aataaacgaa tccacacacg 
agcttggctg aggagctggt gggtagagct 
gcctctccca ccacacccca gcacctggct 
gtgtatctgt ctgcctcagc cccacccaac 
ccaaacttgc tttcttcatc tcatgttgtc 
tcaggacctt accaaacccc ccccccccac 
tggggcttgt gctgggtcct acacggtgct 
ggctcagata ccttccttga cctccgcaca 
cagggtttcc tggcccctcc cctgggatcc 
tctgtgggcc tgaggttttc tggttagaga 
tgacctgccc tctggtgccc acagaccagt 
ctggacggtg acagtcatat cctggagctg 
taacagtgac cacatggcca acaacttgta 
cttgtgctca tttaatcctc ataacagccc 
cctcacttta tacatgagga agtcaaggca 
cacagctagt atgtggtgaa aaccagattg 
ccaaccagta tactttgctg ccccggggtc 
ctggtcccca gccccacaac caccttcaag 
agtggcttta gcactgcctt ctttttcttc 



gacaggaaac gctgtccagt tccccttctt 1260 
cccacctccc aacagcgcca cccgctggcg 1320 
cgccgctggc tgtgccggcg ctgcgtggtg 1380 
gtgtgccgga cgctggactg cgaggccgtg 1440 
cagcggtgcc cggtctgcac gccccgcgaa 1500 
aacgacgaca ctgcctacgc ggggtgaaga 1560 
tcccggttaa taaaatgccc tgtacgcttc 1620 
actgcctcgc ctgcagtcgg gaaagcctgc 1680 
caggaggcag ggaggccgcg aagctactag 1740 
gcgcgccgcc caagccggtc ggaccgggca 1800 
gcggtagcgc tgagcagcgc gggcgtagtg 1860 
gcgcagcggg gcgggctttg ccgggggcgg 1920 
gggccggggc ggggcctggt ggccgcgcgg 1980 
gccggccgct cctccgcgcg ctgttccgca 2040 
ctgccatgcg gctggcgctg ctctgggccc 2100 
cttcctggcc gctcccaaat ataggtgagt 2160 
tgggagggag gtgcaggaaa gtcggaaggc 2220 
gggagagccc agccagagcg cggcccgccc 2280 
gacttggcga tggggtgaaa agagaaggag 234 0 
tggtcatcct ctgcgcggtc cctgcggaca 2400 
gcctgtccag cgccacttca agatcgtgat 2460 
cttcgctgct ctggggccta gctaaccgtt 2520 
ggcttgtagg gtttagtgaa gaggattctc 2580 
aggcactcct caccctggtg ctgttgccac 2640 
atgagccagg agaagggaga attggccagg 2700 
aactttcact tcccgctgct gtcttgggcc 27 60 
tccggaggag agagggccat tgtgtgttgg 2820 
aggctgtagg cagcgggtgt gcctgactgg 2880 
ttgcactgct cacccagaaa tgggcgttcc 2940 
tgagcgggca cccggccaca aggcttagct 3000 
ggggagagtg attgggaggg agtgggaacc 3060 
ccggaagggg acaaggaggg ccacaggctg 3120 
tgaatgggtt ggtgggtgga aagccataaa 3180 
ttctttaagt ctcagcatgc aatgtggaag 3240 
tgggagctgg tggatctgag ggcacctgtc 3300 
actgtcttgg ggtgtcctgg gacctgcagg 3360 
gaaggccccg agggagccct tggagcccca 3420 
caaaaaggtg cttcaggtga gctctcactc 3480 
ccccggtata gccaggtgtc tcaaagccaa 3540 
cactgtagtg ggtctatccc aggcccagct 3600 
tcacttatct ccctctccct ctgcacacac 3660 
ccatccatct ccactgggga aattgtgaag 3720 
ggttttctca gtggggggat ttggaaagag 3780 
cccattctaa agctgagtca gaggaagggc 3840 
tcctctctgg gcaggaagcc gagaaggggt 3900 
caacccccca gaacaatgct ccaggccagg 3960 
ccccaccagt gatctaattg ctggtgctct 4020 
ggctgggagt tgtggacagg tctagggagg 4 080 
ctgcctgagc ccctgaggat caagttggag 4140 
ctacagaata ggtaatagtg atggtggcaa 4200 
tagcatttat tatgtgccag gtactaagtg 4260 
tataagggat atactatcat gtattattgt 4320 
cagagagatt aaataacttg ccccaggtca 4380 
gaattcaaat aaactaacag agtcagtggc 4440 
aggagtggaa aagttggctg cgggggttgc 4500 
cctctgcttg tcaatgcacc gaccctggga 4560 
acttcacagg ggagttggtc ccatgtccgc 4620 
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cccgaccctt ggggtccggc tntcccctct 
tttcttcccc tccgctttcg tccttttgcc 
ccgttccctc tcccctcttt tgttccctcc 
tccttttcgg tccgccctcg ccttcctccc 
ctcgttcttc ctcggtgtcg cgtcgtcccg 
ccgttttttt ccccccgctg tcttcccgtg 
tcggtcgttt tctcgttcca ttcccgcctc 
ttcccgctcc ccgtttctcc cgaccccaac 
nnnnnnnnnn nnnnnnnnnn nrmnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
ttgagcccag gtaggcagag gtttcagtgg 
ggtgaaagag tgagacctcg tctcaaaaaa 
gctgagacag attaggtggt ttgcccgagg 
attagttgta tttggctctt catctgtctt 
gttatagata atacatagtt accaatgatg 
aaaatgaaga taataaaact acttgctgca 
tatagatgta aagtgcttaa aactatgcca 
gggattttta aaattattat tattattatt 
ctcccaggct ggagtgcagt ggcgtgatct 
cacgccattc tcttgcctca gcctcccgag 
cccggctaat gttttgtatt ttttagtaca 
tctcgatctc ctgacctcat gatccacccg 
gcgtgagcca ccgcacccag ctaaattact 
gtttggagcc agaaaagcag gggtctactc 
ttggcaagtt cctgggccct ctggccttca 
ctcctatttg acccacagag taggagtggc 
gctgcgagcg gcaggtttgc ctgataattc 
aacagagacc aagactgtgg agttggtgat 
ggcccctgca catcctcctc cccctgcact 
gcctacaggc ccagaaatac cgggacttcc 
ccctcttgct ggacacagtg agtgctggac 
tggcaacccc tcttctgagc cccagctgtc 
gtggcactag tgggcctgga ggcctggacc 
ccagctgtca ccctcgaaaa cttcctccac 
ccccatgaca gtgcccagct ggtgacgtaa 
ctgtcctggc caaattcaca ccccttcagc 
ccgtggcttc tggccctgaa ctttagcctc 
gcctacggtg ggcatggcca ttcagaactc 
gaacatggtg agttatttcc aggtctcctc 
ggtggcattt atgcactgaa acccccctat 
agagggtgga ggtacgtgat gtggcctttg 
agtcacagtg tacacagtca tcccctgtgc 
ggcagctaag gcccaaagaa cagaggtgat 
gagctagaac ccaagtttct gccatccagg 
ggagggaagc agtgcctgct gagtgcccac 
gagggtggtc tgggcattgt ggtggaggca 
caatgacagg accactccac cagcatcctg 
ggccacagcc tgggcctgga ccatgatttg 
gccccagcca agacctgcat catggaggcc 
gagggtgtgg ggcagggggc agggannnnn 
accttccttg ccaccctccc cagcttccta 
cgggccctgg agaaagccct cctggatgga 



ccccccttcg gcgccgcccc ttcccttttc 4680 
tcccccgtgc cgttgcgcgt tccttcttcc 47 40 
cgttcttttc tcccccgcgt tctttcctcc 4 800 
ttccccttct gcccttcgcc ntttctccct 48 60 
gctcggcctt tccccgcttc ctcccgctcg 4920 
ttccccttcg cttctcctct tccctttcgt 4980 
cccgtttccg ttccactcct tcttcctcct 5040 
aacaaataaa nnnnnnnnnn nnnnnnnnnn 5100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5160 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5280 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5340 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5400 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 5460 
nnnnntcagg aggccgagtg gaagaatcgc 5520 
gccgagatcg agccactaca caccagcctg 5580 
taaaataaaa ataaaataaa ataaaatcta 5640 
ccctacaact aataaatggc ctatccattt 5700 
atgatcccat ttgcagagag ctctcacttg 57 60 
aagcaatata aacccaattt cctaatttgt 5820 
tagagttgct gggaagatta aataagtcca 5880 
gacctatggt aagtgacaag agttgttatt 5940 
attattattt gagacagagt ctcgctctgt 6000 
cggctcactg caagctccgc ctcccaggtt 6060 
tagctgggac tacaggcgcc cgccactaca 6120 
gacagggttt caccgtgtta tccaggatgg 6180 
ccttgtcctc ccaaagtgct gagattacag 6240 
gttttttaaa aatttgaaaa aaaccactga 6300 
caaccttcat tatctacttc ctggtcctcc 6360 
gtggctcatc tgtaaaatgg gctcttcacc 6420 
tgcctcttgg tcagcccggc acagctgctg 64 80 
ttcttgtcca tagtagaggc gggatgtggt 6540 
tgtggctgat cactcggagg tgagcctgct 6600 
gccctgccgc ctttcatgtc acctctcttg 6660 
agcacctgct aaaccgcaca ctggaagtgg 6720 
agggcaaccc ccaccccagg cccctgacca 6780 
tttcagttct tccggcccct gaatgtacga 6840 
cagcgtgacc tggtggagat cagcccaaac 6900 
tggcgcaggg cacatttgct gcctcgattg 6960 
gggccccaga ctcagccaga gaggccagtc 7020 
accctacctc agcccctgaa gctctgacca 7080 
tctgtcccac agtggtactt cattctctgg 7140 
catctgttct cctgacttct caggaggtgt 7200 
ctcattccca attcagttcc tcccaagtgt 7260 
aaagttgccc aaccccaaag ctacaggtat 7320 
ctatcaggga gccctcgctt atggccagct 7380 
agtcttccca tttcttagag gagggtagga 7440 
ctccctccag tgagggaggg ggacagagct 7500 
cctgggttct cctactttag aagcaattca 7560 
gaggtcagac gtggagggaa caggagcaga 7620 
ggctgggact ggacctacag tacccctccc 7680 
ggagtcgcct cctccatagc ccatgagttg 7740 
cctgggaata gctgcccctg tccaggtcca 7800 
tccacagagt aagtagctgc aggatggaga 7860 
nnnnnnnnnn nnnnnnnnnn tgttagagtt 7920 
ccaggcctga acttcagcaa ctgcagccga 7 980 
atgggcagct gcctcttcga acggctgcct 8040 
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agcctacccc ctatggctgc tttctgcgga 
gactgtggct tcctggatgt gagccccttt 
cctcaccctg gctcattagc cctatcccag 
ttccacccct ctcctacttg ccctgtctgt 
ccctcgcttg ctgtgtagct tctggtcttg 
gctcacaggc cccacctgct ctgatgcccg 
cgatccctgc tgtgattctt tgacctgcca 
cggaccctgt tgtcaaaatt gccaggtggg 
ctgccggggc caaggtggaa agggtcattc 
cactgatgct catccaccct ccacagctgc 
gaggggattg tgacttgcct gaattctgcc 
tcagcctagg ggatggcgag ccctgcgctg 
gtgcctccta tgcccagcag tgccagtcac 
cactttgcct ccagaccgct aatactcggg 
ccagtggcag ttatgtgtcc tgcaccccta 
ggtttctgag agccttggcc ctgctcctac 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
tactcacact tttgccttca nacagctaat 
cgcaacccca gtggcagtta tgtgtcctgc 
tcctttgggt ttctgagagc cttggccctg 
ccccacagag atgccatttg tgggcagctc 
ctgggctcca tccgggatct actctgggag 
tgcagctggg tgcacctgga cctgggcagt 
ggcacagcct gtggccctgg cctggtgagc 
gggacatttg gaccacaatg aacagagccc 
gggactccaa gggaagtcag tttcttactt 
tgccttggtt tccccatctg taaacgcagg 
cccattaaag ctttgtggga atctgatcca 
agaccatcga tgccagcgtg tggatctcct 
tggacatggg gtgagctggg atgggggaag 
aaaaggatac tgggctttgg aaatagacat 
cccagttgtg tgacctcggg caggttacta 
aaatggctat aataatagta tccccatcca 
gcacagtggg taactaatag tgcttttaaa 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 



aatatgtttg tggagccggg cgagcagtgt 8100 
cccaaagcct cgccccactc acttctgtac 8160 
cctcctgagc tcttgggttc tgaagggact 8220 
ggggacagca catgggttgt tgggctctag 8280 
gcctgtggga ggaggagaga ttggagggag 8340 
gcccccgtgc tcctgcccac aggactgcgt 8400 
gctgaggcca ggtgcacagt gtgcatctga 84 60 
tagagactag actggccacc cggagctcac 8520 
tgacccccgg ctggatttgc tcagtgccca 8580 
gcccgtctgg ctggcagtgt cgtcctacca 8640 
caggagacag ctcccagtgt ccccctgatg 8700 
gcgggcaagc tgtgtgcatg cacgggcgtt 8760 
tttggggacc tggagcccag cccgctgcgc 8820 
gaaatgcttt tgggagctgt gggcgcaacc 8880 
ggtaagtgag gaaacctggc tcctcctttg 8940 
taactctgtg tgcccttccc cctcnnnnnn 9000 
nnnnnnnnnn nnnnttacgg catttgtagt 9060 
actcggggaa atgcttttgg gagctgtggg 9120 
acccctaggt aagtgaggaa acctggctcc 9180 
ctcctactaa ctctgtgtgc ccttccccct 9240 
cagtgccaga caggtaggac ccagcctctg 9300 
acaatagatg tgaatgggac tgagctgaac 9360 
gatgtggccc agcccctcct gactctgcct 9420 
agcctgggtg ggcaagacca ggtgtgagaa 9480 
agacttcacc attcaccaat gtcaaaggca 9540 
cagatggagc aaagtcctat caactcacta 9600 
gtatggcctc aaccttattg gcctcccagt 9660 
ggctcttctc tccctgggtc aggtgtgtat 9720 
gggggcacag gaatgtcgaa gcaaatgcca 9780 
tggaagggga gcagagagcc tctagagagg 984 0 
atctgggttt taatccttgc tctactactt 9900 
actttgctga gctcagtttc cccacctatc 9960 
gggtacatga gatgtgtatg caagcaagta 10020 
aannnnnnnn nnnnnnnnnn nnnnnnnnnn 10080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10140 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10200 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10260 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10320 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10380 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10440 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10500 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10560 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10620 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10680 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10740 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10800 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10860 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10920 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10980 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11040 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11100 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11160 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11220 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11280 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11340 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 11400 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 114 60 
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gaggttccgc tgcctccaag ccggacttag 
tcccctacca tgactgaagg cgccagagac 
acgcgctgtc aagcaacact ctgcggacct 
ggggctgggg gttggacggg attgaggaag 
ataaacgtga catcttggga gcgttcccca 
tcctgctgcg gttccaggtt tggccgccag 
tgtgtggggc gggagtacca gaaagggtcg 
ctctgcccag actggggtgg ctttcggcgc 
cactctgacc attggctggg aaaagtggat 
gggcggagcg gggcggcggc tgctcccacg 
ctccgccgcc gccacttgag ggatcgggaa 
tcattacttc gtcttaggga ctggggccgc 
agcggcaagg gacagacgcc aatcttaaag 
tagggaaggt ggtcccagag ctgtgttgtc 
aagaaaacgg ggcagtgcat gaaggtggtt 
tcactttccc ctatctgggc aaaggttatg 
ataaggcagt ctaccctgca ctaaagcaga 
gctgttggaa tacgtccttc cttcccgccc 
ctttggcgaa cctggcgcgc gctgcggtgc 
ggagaggagg agacgcgctc ccattggcgg 
tccgtacccc actccccttt ccgcaacttc 
ctgtgaagcg ggccgggacc tgccaggcca 
tgctgcccct gctgcggact gtcctatggg 
ctccagcctc cgccacgtag tctactggaa 
cgggcgagcg cacagccaag tctgcgcgct 
ctttgcgcgg cgccgcctga gcctggccgc 
gggagggggg aggggcgagg ggcgaggcgc 
tggggggcag gcagaagacc ccagccccaa 
ctgggggggc ctttgcccgc actccgggct 
ccagcctccc gcaggctgga gctgtggctg 
ctcttatgaa agagcgtggg ttactctcct 
ctgtctcctg atctcatttg ctatctctgc 
ttctttccag tgtcagccct gcccttctag 
ttcctaacct catttatctc acctgtataa 
tctggcaggg taagtgaggt catgtatgtg 
attatttctt tctctctcct gagctgcctg 
catccccctc cccaagtctc cctgcctctc 
tctacttgag acaatctgtg actattcatt 



ggcttcaaga ggcgggcgtg ccctctggag 14940 
tggcggtgtc ttaagactcc gggcaccgcc 15000 
gccggcgtag ttgcagcggg ggcttgggga 15060 
gtccgcacag cctgtctctg ctcagttgca 15120 
gagtttgtct gcttctagaa cccgggtcgc 15180 
aagacgctgc cgcctcagac gagggcgggc 15240 
gcgtgtgtcc ccgggatgct cgcagcttcc 15300 
aatctgtcaa gctgttggac ctgccgtccc 15360 
ctggctgatg ctcccagagc ccaggagcca 15420 
atcccaaggc cgcgcacctg cctcctcccc 154 80 
caaaggtgct ttgtacaggc cgcaaccacc 15540 
gtgggccccc agcccggaac gaaggtgtgg 15600 
tgagcatcta gcgcgccacc taaggctctt 15660 
ccttccgctt gcactgtccc tagatgtgca 15720 
ggacaggctt catggatcct cgcccgcgcc 15780 
tacccttatt taaaatcttc caaacttcta 15840 
cacgaaagag atgacctccc taaaaatact 15900 
cctcgcagtg cggtgcagcc tcagtggaag 15960 
acagagggtt aactggagtt ggcgctgggt 16020 
aaagttattc aggggcgggg tcagtgaatc 16080 
cctcttcact ttgtaccttt ctctcctcga 16140 
gaccaaaccg gacctcgggg gcgatgcggc 16200 
ccgcgtcctc ggctcccctc tgcgcggggg 16260 
ctccagtaac cccaggtagc cgggccgaac 16320 
cccgggcttt gcgcgcgccc gccacccgct 16380 
gcgccggggc tcctttgttt gagccggcgg 164 40 
gccctgggtc tccccacagc ccgcatgtgt 16500 
gggttgtcta gggggtcttg gagcatggag 16560 
ccgcccccct cgctgctctc ctggcgatcc 16620 
acgaacttga gagcgaggga gggggcttta 16680 
gcccgctggg tctcacctct ggctctcact 16740 
tttcatctct gtctttattg gtccttctgt 16800 
ccgaatcacc tctgggcaag tctcgtgacc 168 60 
tgggctaata atacctagta ccctgggaag 16920 
aaagaggctc aggctgtaca gatataaact 16980 
cctttgaacc ttagtatatt ttactgtttc 17040 
ctatttccta tctgtttttc tttctgattt 17100 
tcttcact 17138 



<210> 4 
<211> 814 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 

<222> (1) . . . (814) 

<223> Xaa = Any Amino Acid 



Leu Gly Ala Gly Ser 
15 

Gly Thr Glu Glu Gin 
30 



<400> 4 

Met Arg Leu Ala Leu Leu Trp Ala Leu Gly Leu 

15 10 
Pro Leu Pro Ser Trp Pro Leu Pro Asn lie Gly 
20 25 
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Gin 


Ala 




Ser 


Glu 


Lys 


Ala 




Arg 


G u 


Pro 


Leu 




Pro 


Gin 


Val 






35 U 










40° 










45~ U 








Leu 


Gin 


As 
sp 


As 
sp 


Leu 


Pro 


lie 


Ser 


Leu 


L s 


L s 


Val 


Leu 


Gin 


Thr 


Ser 




50 










55 




















Leu 


Pro 


Glu 


Pro 


Leu 




lie 


ys 


Leu 


Glu 


Leu 


a° 


Gly 


A 


Ser 


His 


65 










70 










75 










80 


lie 


Leu 


Glu 


Leu 


Leu 


Gin 


Asr 


rg 


Glu 


Leu 


Val 


Pro 


Gl 


g 


Pro 












85 




















95 




Leu 


Val 


irp 


lyr 


Gin 


Pro 


sp 


CI 


Thr 


rg 


Val 


Val 


Ser 


Glu 


Gl 

y 


His 








100 




















110 






r 


eu 




sn 


ys 


ys 


Tyr 




y 


rg 


a 




y 


Tyr 




y 
















?n 


















er 


Trp 


a 


er 


6 


ys 




ys 


er 


y 


eu 




y 


eu 


V 


3 














135 










140 












Thr 


Pro 
r 


Glu 




r 


Tyr 


Th 

r 


Leu 
eu 


U 




Y 


ro 


y 


sp 


Leu 


. _ 
































n 


y 


ro 


ro 


*T 


e 


Ser 


rg 


e 




sp 


eu 


is 


eu 




y 




















1 70 














is 


r 


ys 




eu 


er 


Trp 


rg 




er 


3 


is 


r 


Gl 


Th 

r 


ro 








an 










ft R 










on 






Pro 


Glu 


. 


Pro 


Leu 


Gly 


Gin 




His 


e 


Arg 


rg 


on? 


rg 


sp 


a 






iyo 










200 


















al 




u 


r 


ys 


r 


a 


u 


eu 


a 


e 




A a 


Asp 


IS 


Ser 
























oon 












Ala 


Gin 


Lys 


Tyr 




Asp 


Phe 


Gin 


His 


Leu 


Leu 


Asn 


Arg 


Thr 


Leu 












230 










235 










240 


Glu 


Val 


Ala 


Leu 




Leu 


Asp 


Thr 


Phe 




Arg 


Pro 


Leu 


Asn 




Arg 


































al 


Ala 


Leu 




Gly 


Leu 


Glu 


Ala 


Trp 


Thr 


Gin 


Arg 


Asp 


Leu 


Val 


Glu 








260 


























e 


er 




sn 


ro 


a 


a 




eu 


u 


Asn 


e 




His 


Trp 


Arg 
















o«n 
280 


















Arg 


Ala 


His 


Leu 


Leu 


Pro 


Arg 


Leu 


Pro 


His 


Asp 


Ser 


Ala 


Gin 


Leu 


Val 




290 










295 










300 












Gly 


Thr 


Ser 


Phe 




Gly 


Pro 


Thr 


Val 


Gly 


Met 


Ala 


lie 


Gin 


Asn 












310 










315 










320 


Ser 


lie 


Cys 


Ser 


Pro 


Asp 


Phe 


Ser 


Gly 


Gly 


Val 


Asn 


Met 


Asp 


His 


Ser 










325 










330 










335 




r 


er 


e 




y 


al 


a 


Ser 




lie 


A a 


His 


Glu 


^n 


Gly 


His 








e ^ 










345 










350 






Ser 


Leu 




Leu 


Asp 


His 


Asp 


Leu 


Pro 


Gly 


Asn 


Ser 


l r ys 


Pro 


Cys 


Pro 






355 










360 










365 








Gly 


^° 


Ala 


Pro 


Ala 


Lys 


Thr 


Cys 


lie 


Met 


Glu 


Ala 


Ser 


Thr 


Asp 


Phe 




37 0 










375 










380 












ro 


Gly 


Leu 


Asn 




Ser 


Asn 


Cys 


Ser 


Arg 


Arg 


Ala 


Leu 


Glu 


Lys 












-in 










395 










400 




eu 


eu 


sp 




M t 


y 


Ser 


Cys 




Phe 


u 


Arg 


Leu 


Pro 


Ser 




















410 










415 




eu 


ro 


ro 


Met 




A a 


P e 


Cys 


Gly 


Asn 


Met 


Phe 


Val 




Pro 


Gly 








420 










425 










430 






Glu 


Gin 


Cys 


Asp 


Cys 


Gly 


Phe 


Leu 


Asp 


Asp 


Cys 


Val 


Asp 


Pro 


Cys 


Cys 






435 










440 










445 








Asp 


Ser 


Leu 


Thr 


Cys 


Gin 


Leu 


Arg 


Pro 


Gly 


Ala 


Gin 


Cys 


Ala 


Ser 


Asp 




450 










455 










460 










Gly 


Pro 


Cys 


Cys 


Gin 


Asn 


Cys 


Gin 


Leu 


Arg 


Pro 


Ser 


Gly 


Trp 


Gin 


Cys 


4 65 










470 










475 










480 
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Arg Pro Thr Arg Gly Asp Cys Asp Leu Pro Glu Phe Cys Pro Gly Asp 

485 490 495 

Ser Ser Gin Cys Pro Pro Asp Val Ser Leu Gly Asp Gly Glu Pro Cys 

500 505 510 

Ala Gly Gly Gin Ala Val Cys Met His Gly Arg Cys Ala Ser Tyr Ala 

515 520 525 

Gin Gin Cys Gin Ser Leu Trp Gly Pro Gly Ala Gin Pro Ala Ala Pro 

530 535 540 

Leu Cys Leu Gin Thr Ala Asn Thr Arg Gly Asn Ala Phe Gly Ser Cys 
545 550 555 560 

Gly Arg Asn Pro Ser Gly Ser Tyr Val Ser Cys Thr Pro Arg Asp Ala 

565 570 575 

He Cys Gly Gin Leu Gin Cys Gin Thr Gly Arg Thr Gin Pro Leu Leu 

580 585 590 

Gly Ser He Arg Asp Leu Leu Trp Glu Thr He Asp Val Asn Gly Thr 

595 600 605 

Glu Leu Asn Cys Ser Trp Val His Leu Asp Leu Gly Ser Asp Val Ala 

610 615 620 

Gin Pro Leu Leu Thr Leu Pro Gly Thr Ala Cys Gly Pro Gly Leu Val 
625 630 635 640 

Cys He Asp His Arg Cys Gin Arg Val Asp Leu Leu Gly Ala Gin Glu 

645 650 655 

Cys Arg Ser Lys Cys His Gly His Gly Val Cys Asp Ser Asn Arg His 

660 665 670 

Cys Tyr Cys Glu Glu Gly Trp Ala Pro Pro Asp Cys Thr Thr Gin Leu 

675 680 685 

Lys Ala Thr Ser Ser Leu Thr Thr Gly Leu Leu Leu Ser Leu Leu Val 

690 695 700 

Leu Leu Val Leu Val Met Leu Gly Ala Ser Tyr Trp Tyr Arg Ala Arg 
705 710 715 ' 720 

Leu Xaa Gin Arg Leu Cys Gin Leu Lys Gly Pro Thr Cys Gin Tyr Arg 

725 730 ~ 735 

Ala Ala Gin Ser Gly Pro Ser Glu Arg Pro Gly Pro Pro Gin Arg Ala 

740 745 750 

Leu Leu Ala Arg Gly Thr Lys Ser Gin Gly Pro Ala Lys Pro Pro Pro 

755 760 765 

Pro Arg Lys Pro Leu Pro Ala Asp Pro Gin Gly Arg Cys Pro Ser Gly 

77 ° 775 780 

Asp Leu Pro Gly Pro Gly Pro Gly He Pro Pro Leu Val Val Pro Ser 
785 790 795 800 

Arg Pro Ala Pro Pro Pro Pro Thr Val Ser Ser Leu Tyr Leu 
805 810 
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